04:00
2026-06-15
arxiv.org
large-language-models
DLawBench: Evaluating LLMs Through Multi-Turn Legal Consultation
Researchers introduced DLawBench, a benchmark for evaluating large language models on multi-turn legal consultation tasks. The benchmark includes 461 cases from Chinese and U.S. law and tests models oโฆ